LLM Evaluation Metrics Dashboard Dummy

Metrics Distribution

95% confidence interval for all the metrics

Detailed Explanations

correctness Details

Question Answer Golden Answer Reason Score
Is Neo4j supported by cognee?Yes, Neo4j is supported by cognee.YesThe actual output directly confirms the expected output without any contradictions or omissions. It provides a complete and accurate response to the input question, aligning perfectly with the evaluation steps.0.9691841146183957

EM Details

Question Answer Golden Answer Reason Score
Is Neo4j supported by cognee?Yes, Neo4j is supported by cognee.YesNot an exact match0.0

f1 Details

Question Answer Golden Answer Reason Score
Is Neo4j supported by cognee?Yes, Neo4j is supported by cognee.YesF1: 0.29 (Precision: 0.17, Recall: 1.00)0.2857142857142857

contextual_relevancy Details

Question Answer Golden Answer Reason Score
Is Neo4j supported by cognee?Yes, Neo4j is supported by cognee.YesThe score is 1.00 because the statement 'Neo4j is a graph database supported by cognee' directly answers the input question, showing perfect relevancy. Great job!1.0

context_coverage Details

Question Answer Golden Answer Reason Score
Is Neo4j supported by cognee?Yes, Neo4j is supported by cognee.YesThe score is 0.60 because the summary fails to address specific questions that the original text can answer, such as whether Cognee supports NetworkX. However, there are no contradictions or extra information, indicating a moderate level of accuracy in the summarization.0.6